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SEQUENCE LISTING 

SEQ ID N0:1 

EB0GP1 /MBGGP2 nucleotide sequence 

10 20 30 40 50 

atgggcgtta caggaatatt gcagttacct cgtgatcgat tcaagaggac 

60 70 80 90 100 

atcattcttt ctttgggtaa ttatcctttt ccaaagaaca ttttccatcc 

110 120 130 140 150 

cacttggagt catccacaat agcacattac aggttagtga tgtcgacaaa 

160 170 180 190 200 

ctagtttgtc gtgacaaact gtcatccaca aatcaattga gatcagttgg 

210 220 230 240 250 

actgaatctc gaagggaatg gagtggcaac tgacgtgcca tctgcaacta 

260 270 280 290 300 

aaagatgggg cttcaggtcc ggtgtcccac caaaggtggt caattatgaa 

310 320 330 340 350 

gctggtgaat gggctgaaaa ctgctacaat cttgaaatca aaaaacctga 

360 370 380 390 400 

cgggagtgag tgtctaccag cagcgccaga cgggattcgg ggcttccccc 

410 420 430 440 450 

ggtgccggta tgtgcacaaa gtatcaggaa cgggaccgtg tgccggagac 

460 470 480 490 500 

tttgccttcc ataaagaggg tgctttcttc ctgtatgatc gacttgcttc 

510 520 530 540 550 

cacagttatc taccgaggaa cgactttcgc tgaaggtgtc gttgcatttc 

560 570 580 590 600 

tgatactgcc ccaagctaag aaggacttct tcagctcaca ccccttgaga 

610 620 630 640 650 

gagccggtca atgcaacgga ggacccgtct agtggctact attctaccac 

660 670 680 690 700 

aattagatat caggctaccg gttttggaac caatgagaca gagtacttgt 

710 720 730 740 750 

tcgaggttga caatttgacc tacgtccaac ttgaatcaag attcacacca 



760 770 780 790 800 

cagtttctgc tccagctgaa tgagacaata tatacaagtg ggaaaaggag 
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810 820 830 840 850 

caataccacg ggaaaactaa tttggaaggt caaccccgaa attgatacaa 

860 870 880 890 900 

caatcgggga gtgggccttc tgggaaacta aaaaaaacct cactagaaaa 

910 920 930 940 950 

attcgcagtg aagagttgtc tttcacagtt gtatcaaacg gagccaaaaa 

960 970 980 990 1000 

catcagtggt cagagtccgg cgcgaacttc ttccgaccca gggaccaaca 

1010 1020 1030 1040 1050 

caacaactga agaccacaaa atcatggctt cagaaaattc ctctgcaatg 

1060 1070 1080 1090 1100 

gttcaagtgc acagtcaagg aagggaagct gcagtgtcgc atctaacaac 

1110 1120 1130 1140 1150 

ccttgccaca atctccacga gtccccaatc cctcacaacc aaaccaggtc 

1160 1170 1180 1190 1200 

cggacaacag cacccataat acacccgtgt ataaacttga catctctgag 

1210 1220 1230 1240 1250 

gcaactcaag ttgaacaaca tcaccgcaga acagacaacg acagcacagc 

1260 1270 1280 1290 1300 

ctccgacact ccctctgcca cgaccgcagc cggaccccca aaagcagaga 

1310 1320 1330 1340 1350 

acaccaacac gagcaagagc actgacttcc tggaccccgc caccacaaca 

1360 1370 1380 1390 1400 

agtccccaaa accacagcga gaccgctggc aacaacaaca ctcatcacca 

1410 1420 1430 1440 1450 

agataccgga gaagagagtg ccagcagcgg gaagctaggc ttaattacca 

1460 1470 1480 1490 1500 

atactattgc tggagtcgca ggactgatca caggcgggag aagaactcga 

1510 1520 1530 1540 1550 

cgatcgatcc tctggaggga aggcgacatg ttcccttttc tggatgggtt 

1560 1570 1580 1590 1600 

aataaatgct ccaattgatt ttgacccagt tccaaataca aaaacaatct 

1610 1620 1630 1640 1650 

ttgatgaatc ctctagttct ggtgcctcgg ctgaggaaga tcaacatgcc 

1660 1670 1680 1690 1700 

tcccccaata ttagtttaac tttatcttat tttcctaata taaatgagaa 
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1710 


1720 


1730 


1740 


1750 


cactgcctac 


tctggagaaa 


atgagaatga 


ttgtgatgca 


gagttaagaa 


1760 


1770 


1780 


1790 


1800 


tttggagcgt 


tcaggaggat 


gacctggccg 


cagggctcag 


ttggataccg 


1810 


1820 


1830 


1840 


1850 


ttttttggcc 


ctggaattga 


aggactttac 


actgctgttt 


taattaaaaa 


1860 


1870 


1880 


1890 


1900 


tcaaaacaat 


ttggtctgca 


ggttgaggcg 


tctagccaat 


caaactgcca 


1910 


1920 


1930 


1940 


1950 


aatccttgga 


actcttattg 


agagtcacaa 


ctgaggaaag 


aacattctcc 


1960 


1970 


1980 


1990 


2000 


ttaatcaata 


gacatgctat 


tgactttcta 


ctcacaagat 


ggggaggaac 


2010 


2020 


2030 


2040 


2050 


atgcaaagtg 


cttggacctg 


attgttgcat 


cgggatagaa 


gacttgtcca 


2060 


2070 


2080 


2090 


2100 


aaaatatttc 


agagcaaatt 


gaccaaatta 


aaaaggacga 


acaaaaagag 


2110 


2120 


2130 


2140 


2150 


gggactggtt 


ggggtctggg 


tggtaaatgg 


tggacatccg 


actggggtgt 


2160 


2170 


2180 


2190 


2200 


tcttactaac 


ttgggcattt 


tgctactatt 


atccatagct 


gtcttgattg 


2210 


2220 


2230 


2240 


2250 


ctctatcctg 


tatttgtcgt 


atctttacta 


aatatatcgg 


ataacggaat 



tc 

SEQ ID NO: 2 

EBOGPl /MBGGP2 amino acid sequence 

MetGlyValThrGlylleLeuGlnLeuProArgAspArgPheLys 

ArgThrSerPhePheLeuTrpValllelleLeuPheGlnArgThrPhe 

SerlleProLeuGlyVallleHisAsnSerThrLeuGlnValSerAsp 

ValAspLysLeuValCysArgAspLysLeuSerSerThrAsnGlnLeuArg 

SerValGlyLeuAsnLeuGluGlyAsnGlyValAlaThrAspValProSer 

AlaThrLysArgTrpGlyPheArgSerGlyValProProLysValVal 

AsnTyrGluAlaGlyGluTrpAlaGluAsnCysTyrAsnLeuGluIleLys 

LysProAspGlySerGluCysLeuProAlaAlaProAspGlylleArgGly 
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PheProArgCysArgTyrValHisLysValSerGlyThrGlyProCys 

AlaGlyAspPheAlaPheHisLysGluGlyAlaPhePheLeuTyrAspArg 

LeuAlaSerThrVallleTyrArgGlyThrThrPheAlaGluGlyValVal 

AlaPheLeuIleLeuProGlnAlaLysLysAspPhePheSerSerHis 

ProLeuArgGluProValAsnAlaThrGluAspProSerSerGlyTyrTyr 

SerThrThrlleArgTyrGlnAlaThrGlyPheGlyThrAsnGluThrGlu 

TyrLeuPheGluValAspAsnLeuThrTyrValGlnLeuGluSerArg 

PheThrProGlnPheLeuLeuGlnLeuAsnGluThrlleTyrThrSerGly 

LysArgSerAsnThrThrGlyLysLeuIleTrpLysValAsnProGluIle 

AspThrThrlleGlyGluTrpAlaPheTrpGluThrLysLysAsnLeu 

ThrArgLysIleArgSerGluGluLeuSerPheThrValValSerAsnGly 

AlaLysAsnlleSerGlyGlnSerProAlaArgThrSerSerAspProGly 

ThrAsnThrThrThrGluAspHisLysIleMetAlaSerGluAsnSer 

SerAlaMetValGlnValHisSerGlnGlyArgGluAlaAlaValSerHis 

LeuThrThrLeuAlaThrlleSerThrSerProGlnSerLeuThrThrLys 

ProGlyProAspAsnSerThrHisAsnThrProValTyrLysLeuAsp 

IleSerGluAlaThrGlnValGluGlnHisHisArgArgThrAspAsnAsp 

SerThrAlaSerAspThrProSerAlaThrThrAlaAlaGlyProProLys 

AlaGluAsnThrAsnThrSerLysSerThrAspPheLeuAspProAla 

ThrThrThrSerProGlnAsnHisSerGluThrAlaGlyAsnAsnAsnThr 

HisHisGlnAspThrGlyGluGluSerAlaSerSerGlyLysLeuGlyLeu 

IleThrAsnThrlleAlaGlyValAlaGlyLeuIleThrGlyGlyArg 

ArgThrArgArgSerlleLeuTrpArgGluGlyAspMetPheProPheLeu 

AspGlyLeuIleAsnAlaProIleAspPheAspProValProAsnThrLys 

ThrllePheAspGluSerSerSerSerGlyAlaSerAlaGluGluAsp 

GlnHisAlaSerProAsnlleSerLeuThrLeuSerTyrPheProAsnlle 

AsnGluAsnThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAlaGlu 
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LeuArglleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeuSer 

TrpIleProPhePheGlyProGlylleGluGlyLeuTyrThrAlaValLeu 

IleLysAsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsnGln 

ThrAlaLysSerLeuGluLeuLeuLeuArgValThrThrGluGluArg 

ThrPheSerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArgTrp 

GlyGlyThrCysLysValLeuGlyProAspCysCysIleGlylleGluAsp 

LeuSerLysAsnlleSerGluGlnlleAspGlnlleLysLysAspGlu 

GlnLysGluGlyThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSerAsp 

TrpGlyValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAlaVal 

LeuIleAlaLeuSerCysIleCysArgllePheThrLysTyrlleGly* 

SEQ ID NO: 3 

MBGGP1 / EBOGP2 nucleotide sequence 



10 


20 


30 


40 


50 


atgaagacca 


catgtttcct 


tatcagtctt 


atcttaattc 


aagggacaaa 


60 


70 


80 


90 


100 


aaatctcccc 


attttagaga 


tagctagtaa 


taatcaaccc 


caaaatgtgg 


110 


120 


130 


140 


150 


attcggtatg 


ctccggaact 


ctccagaaga 


cagaagacgt 


ccatctgatg 


160 


170 


180 


190 


200 


ggattcacac 


tgagtgggca 


aaaagttgct 


gattcccctt 


tggaggcatc 


210 


220 


230 


240 


250 


caagcgatgg 


gctttcagga 


caggtgtacc 


tcccaagaat 


gttgagtaca 


260 


270 


280 


290 


300 


cagaggggga 


ggaagccaaa 


acatgctaca 


atataagtgt 


aacggatccc 


310 


320 


330 


340 


350 


tctggaaaat 


ccttgctgtt 


agatcctcct 


accaacatcc 


gtgactatcc 


360 


370 


380 


390 


400 


gaaatgcaaa 


actatccatc 


atattcaagg 


tcaaaaccct 


catgcacagg 


410 


420 


430 


440 


450 


ggatcgccct 


tcatttatgg 


ggagcatttt 


ttctgtatga 


tcgcattgcc 


460 


470 


480 


490 


500 


tccacaacaa 


tgtaccgagg 


caaagtcttc 


actgaaggga 


acatagcagc 
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510 520 530 540 550 

tatgattgtc aataagacag tgcacaaaat gattttctcg cggcaaggac 

560 570 580 590 600 

aagggtaccg tcatatgaat ctgacttcta ctaataaata ttggacaagt 

610 620 630 640 650 

agtaacggaa cgcaaacgaa tgacactgga tgtttcggcg ctcttcaaga 

660 670 680 690 700 

atacaattct acaaagaacc aaacatgtgc tccgtccaaa atacctccac 

710 720 730 740 750 

cactgcccac agcccgtccg gagatcaaac tcacaagcac cccaactgat 

760 770 780 790 800 

gccaccaaac tcaataccac ggacccaagc agtgatgatg aggacctcgc 

810 820 830 840 850 

aacatccggc tcagggtccg gagaacgaga accccacaca acttctgatg 

860 870 880 890 900 

cggtcaccaa gcaagggctt tcatcaacaa tgccacccac tccctcacca 

910 920 930 940 950 

caaccaagca cgccacagca aggaggaaac aacacaaacc attcccaaga 

960 970 980 990 1000 

tgctgtgact gaactagaca aaaataacac aactgcacaa ccgtccatgc 

1010 1020 1030 1040 1050 

cccctcataa cactaccaca atctctacta acaacacctc caaacacaac 

1060 1070 1080 1090 1100 

ttcagcactc tctctgcacc attacaaaac accaccaatg acaacacaca 

1110 1120 1130 1140 1150 

gagcacaatc actgaaaatg agcaaaccag tgccccctcg ataacaaccc 

1160 1170 1180 1190 1200 

tgcctccaac gggaaatccc accacagcaa agagcaccag cagcaaaaaa 

1210 1220 1230 1240 1250 

ggccccgcca caacggcacc aaacacgaca aatgagcatt tcaccagtcc 

1260 1270 1280 1290 1300 

tccccccacc cccagctcga ctgcacaaca tcttgtatat ttcagaagaa 

1310 1320 1330 1340 1350 

agcgatcggc aattgtcaat gctcaaccca aatgcaaccc taatttacat 

1360 1370 1380 1390 1400 

tactggacta ctcaggatga aggtgctgca atcggactgg cctggatacc 
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1410 


1420 


1430 


1440 


1450 




^LaytayuLy 


a r~r it f~r ^ a t* i" f 3 


v-ci tay ciyyyy 




1460 


1470 


1480 


1490 


1500 


a. Uv-.cia.y ct uy y 


LLLaaLLLyL 


yyyuuyciycic 


ciycuyyccaa 


cgagacgac l 


1510 


1520 


1530 


1540 


1550 


CaayCCCLCC 


r^s rrs /-^ 4* 4~ 4- /-^ +- 

aaCCyttCCC 


gagagccaca 


act gage tac 


gcacct t t tc 


1560 


1570 


1580 


1590 


1600 


aaCCCtCaaC 


/■"« /~y |- a a rrr**~< sa a 

cgLaayyCaa 


LLyaLLtCLL 


yCLyCayCya 


t-ggggeggea 


1610 


1620 


1630 


1640 


1650 


CaCgCCaCdt 


LCLygyaCcg 


gac cgctgca 


tcgaaccaca 


tgat tggacc 


1660 


1670 


1680 


1690 


1700 


— s -j -a -a /-« —a -a -a 

aaydaCauda 


cagacaaaa l 


cga ucagauc 


a l l c a l. ga u l. 


ccgccgataa 


1710 


1720 


1730 


1740 


1750 


aacccttccg 


gaccaggggg 


acaatgacaa 


ttggtggaca 


ggatggagac 


1760 


1770 


1780 


1790 


1800 


aatggatacc 


ggcaggtatt 


ggagttacag 


gcgttataat 


tgcagttatc 


1810 


1820 


1830 


1840 




gctttattct 


gtatatgcaa 


atttgtcttt 


tagttgaatt 


c 



SEQ ID NO: 4 

MBG1/EB02 amino acid sequence 

MetLysThrThrCysPheLeuIleSerLeuIleLeuIleGlnGlyThr 

LysAsnLeuProIleLeuGluIleAlaSerAsnAsnGlnProGln 

AsnValAspSerValCysSerGlyThrLeuGlnLysThrGluAspVal 

HisLeuMetGlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeu 

GluAlaSerLysArgTrpAlaPheArgThrGlyValProProLysAsn 

ValGluTyrThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerVal 

ThrAspProSerGlyLysSerLeuLeuLeuAspProProThrAsnlleArg 

AspTyrProLysCysLysThrlleHisHisIleGlnGlyGlnAsnPro 

HisAlaGlnGlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAsp 

ArglleAlaSerThrThrMetTyrArgGlyLysValPheThrGluGlyAsn 

IleAlaAlaMetlleValAsnLysThrValHisLysMetllePheSer 
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ArgGlnGlyGlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyr 

TrpThrSerSerAsnGlyThrGlnThrAsnAspThrGlyCysPheGlyAla 

LeuGlnGluTyrAsnSerThrLysAsnGlnThrCysAlaProSerLys 

IleProProProLeuProThrAlaArgProGluIleLysLeuThrSerThr 

ProThrAspAlaThrLysLeuAsnThrThrAspProSerSerAspAspGlu 

AspLeuAlaThrSerGlySerGlySerGlyGluArgGluProHisThr 

ThrSerAspAlaValThrLysGlnGlyLeuSerSerThrMetProProThr 

ProSerProGlnProSerThrProGlnGlnGlyGlyAsnAsnThrAsnHis 

SerGlnAspAlaValThrGluLeuAspLysAsnAsnThrThrAlaGln 

ProSerMetProProHisAsnThrThrThrlleSerThrAsnAsnThrSer 

LysHisAsnPheSerThrLeuSerAlaProLeuGlnAsnThrThrAsnAsp 

AsnThrGlnSerThrlleThrGluAsnGluGlnThrSerAlaProSer 

IleThrThrLeuProProThrGlyAsnProThrThrAlaLysSerThrSer 

SerLysLysGlyProAlaThrThrAlaProAsnThrThrAsnGluHisPhe 

ThrSerProProProThrProSerSerThrAlaGlnHisLeuValTyr 

PheArgArgLysArgSerAlalleValAsnAlaGlnProLysCysAsnPro 

AsnLeuHisTyrTrpThrThrGlnAspGluGlyAlaAlalleGlyLeuAla 

TrpIleProTyrPheGlyProAlaAlaGluGlylleTyrlleGluGly 

LeuMetHisAsnGlnAspGlyLeuIleCysGlyLexiArgGlnLeuAlaAsn 

GluThrThrGlnAlaLeuGlnLeuPheLeuArgAlaThrThrGluLeuArg 

ThrPheSerlleLeuAsnArgLysAlalleAspPheLeuLeuGlnArg 

TrpGlyGlyThrCysHisIleLeuGlyProAspCysCysIleGluProHis 

AspTrpThrLysAsnlleThrAspLysIleAspGlnllelleHisAspPhe 

ValAspLysThrLeuProAspGlnGlyAspAsnAspAsnTrpTrpThr 

GlyTrpArgGlnTrpIleProAlaGlylleGlyValThrGlyValllelle 

AlaVallleAlaLeuPheCysIleCysLysPheValPhe* 
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SEQ ID NO: 5 

MUSGP1/RVNGP2 nucleotide sequence 

Sequence Range: 1 to 2046 

10 20 30 40 50 

atgaagacca catgtttcct tatcagtctt atcttaattc aagggacaaa 

60 70 80 90 100 

aaatctcccc attttagaga tagctagtaa taatcaaccc caaaatgtgg 

110 120 130 140 150 

attcggtatg ctccggaact ctccagaaga cagaagacgt ccatctgatg 

160 170 180 190 200 

ggattcacac tgagtgggca aaaagttgct gattcccctt tggaggcatc 

210 220 230 240 250 

caagcgatgg gctttcagga caggtgtacc tcccaagaat gttgagtaca 

260 270 280 290 300 

cagaggggga ggaagccaaa acatgctaca atataagtgt aacggatccc 

310 320 330 340 350 

tctggaaaat ccttgctgtt agatcctcct accaacatcc gtgactatcc 

360 370 380 390 400 

gaaatgcaaa actatccatc atattcaagg tcaaaaccct catgcacagg 

410 420 430 440 450 

ggatcgccct tcatttatgg ggagcatttt ttctgtatga tcgcattgcc 

460 470 480 490 500 

tccacaacaa tgtaccgagg caaagtcttc actgaaggga acatagcagc 

510 520 530 540 550 

tatgattgtc aataagacag tgcacaaaat gattttctcg cggcaaggac 

560 570 580 590 600 

aagggtaccg tcatatgaat ctgacttcta ctaataaata ttggacaagt 

610 620 630 640 650 

agtaacggaa cgcaaacgaa tgacactgga tgtttcggcg ctcttcaaga 

660 670 680 690 700 

atacaattct acaaagaacc aaacatgtgc tccgtccaaa atacctccac 

710 720 730 740 750 

cactgcccac agcccgtccg gagatcaaac tcacaagcac cccaactgat 

760 770 780 790 800 

gccaccaaac tcaataccac ggacccaagc agtgatgatg aggacctcgc 

810 820 830 840 850 
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aacatccggc tcagggtccg gagaacgaga accccacaca acttctgatg 

860 870 880 890 900 

cggtcaccaa gcaagggctt tcatcaacaa tgccacccac tccctcacca 

910 920 930 940 950 

caaccaagca cgccacagca aggaggaaac aacacaaacc attcccaaga 

960 970 980 990 1000 

tgctgtgact gaactagaca aaaataacac aactgcacaa ccgtccatgc 

1010 1020 1030 1040 1050 

cccctcataa cactaccaca atctctacta acaacacctc caaacacaac 

1060 1070 1080 1090 1100 

ttcagcactc tctctgcacc attacaaaac accaccaatg acaacacaca 

1110 1120 1130 1140 1150 

gagcacaatc actgaaaatg agcaaaccag tgccccctcg ataacaaccc 

1160 1170 1180 1190 1200 

tgcctccaac gggaaatccc accacagcaa agagcaccag cagcaaaaaa 

1210 1220 1230 1240 1250 

ggccccgcca caacggcacc aaacacgaca aatgagcatt tcaccagtcc 

1260 1270 1280 1290 1300 

tccccccacc cccagctcga ctgcacaaca tcttgtatat ttcagaagaa 

1310 1320 1330 1340 1350 

agcgatcgat tttctggaaa gaaggtgata tattcccgtt tttagatggg 

1360 1370 1380 1390 1400 

ttaataaata ctgaaattga ttttgatcca atcccaaaca cagaaacaat 

1410 1420 1430 1440 1450 

ctttgatgaa tctcccagct ttaatacttc aactaatgag gaacaacaca 

1460 1470 1480 1490 1500 

ctcccccgaa tatcagttta actttctctt attttcctga taaaaatgga 

1510 1520 1530 1540 1550 

gatactgcct actctgggga aaacgagaat gattgtgatg cagagttgag 

1560 1570 1580 1590 1600 

gatttggagt gtgcaggagg acgatttggc ggcagggctt agctggatac 

1610 1620 1630 1640 1650 

cattttttgg ccctggaatc gaaggactct atactgccgg tttaatcaaa 

1660 1670 1680 1690 1700 

aatcagaaca atttagtttg taggttgagg cgcttagcta atcaaactgc 

1710 1720 1730 1740 1750 
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taaatccttg 


gagctcttgt 


taagggtcac 


aaccgaggaa 


aggacatttt 


1760 


1770 


1780 


1790 


1800 


ccttaatcaa 


taggcatgca 


attgactttt 


tgcttacgag 


gtggggcgga 


1810 


1820 


1830 


1840 


1850 


acatgcaagg 


tgctaggacc 


tgattgttgc 


ataggaatag 


aagatctatc 


1860 


1870 


1880 


1890 


1900 


taaaaatatc 


tcagaacaaa 


tcgacaaaat 


cagaaaggat 


gaacaaaagg 


1910 


1920 


1930 


1940 


1950 


aggaaactgg 


ctggggtcta 


ggtggcaaat 


ggtggacatc 


tgactggggt 


1960 


1970 


1980 


1990 


2000 


gttctcacca 


atttgggcat 


cctgctacta 


ttatctatag 


ctgttctgat 


2010 


2020 


2030 


2040 




tgctctgtcc 


tgtatctgtc 


gtatcttcac 


taaatacatt 


ggatga 



SEQ ID NO: 6 

MUSGPl /RVNGP2 amino acid sequence 

MetLysThrThrCysPheLeuIleSerLeuIleLeuIleGlnGlyThr 

LysAsnLeuProIleLeuGluIleAlaSerAsnAsnGlnProGln 

AsnValAspSerValCysSerGlyThrLeuGlnLysThrGluAspVal 

HisLeuMetGlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeu 

GluAlaSerLysArgTrpAlaPheArgThrGlyValProProLysAsn 

ValGluTyrThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerVal 

ThrAspProSerGlyLy s SerLeuLeuLeuAspProProThrAsnl leArg 

AspTyrProLysCysLysThrlleHisHisIleGlnGlyGlnAsnPro 

HisAlaGlnGlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAsp 

ArglleAlaSerThrThrMetTyrArgGlyLysValPheThrGluGlyAsn 

IleAlaAlaMetlleValAsnLysThrValHisLysMetllePheSer 

ArgGlnGlyGlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyr 

TrpThrSerSerAsnGlyThrGlnThrAsnAspThrGlyCysPheGlyAla 

LeuGlnGluTyrAsnSerThrLysAsnGlnThrCysAlaProSerLys 

IleProProProLeuProThrAlaArgProGluIleLysLeuThrSerThr 
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ProThrAspAlaThrLysLeuAsnThrThrAspProSerSerAspAspGlu 

AspLeuAlaThrSerGlySerGlySerGlyGluArgGluProHisThr 

ThrSerAspAlaValThrLysGlnGlyLeuSerSerThrMetProProThr 

ProSerProGlnProSerThrProGlnGlnGlyGlyAsnAsnThrAsnHis 

SerGlnAspAlaValThrGluLeuAspLysAsnAsnThrThrAlaGln 

ProSerMetProProHisAsnThrThrThrlleSerThrAsnAsnThrSer 

LysHisAsnPheSerThrLeuSerAlaProLeuGlnAsnThrThrAsnAsp 

AsnThrGlnSerThrlleThrGluAsnGluGlnThrSerAlaProSer 

IleThrThrLeuProProThrGlyAsnProThrThrAlaLysSerThrSer 

SerLysLysGlyProAlaThrThrAlaProAsnThrThrAsnGluHisPhe 

ThrSerProProProThrProSerSerThrAlaGlnHisLeuValTyr 

PheArgArgLysArgSerllePheTrpLysGluGlyAspIlePheProPhe 

LeuAspGlyLeuIleAsnThrGluIleAspPheAspProIleProAsnThr 

GluThrllePheAspGluSerProSerPheAsnThrSerThrAsnGlu 

GluGlnHisThrProProAsnlleSerLeuThrPheSerTyrPheProAsp 

LysAsnGlyAspThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAla 

GluLeuArglleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeu 

SerTrpIleProPhePheGlyProGlylleGluGlyLeuTyrThrAlaGly 

LeuIleLysAsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsn 

GlnThrAlaLysSerLeuGluLeuLeuLeuArgValThrThrGluGlu 

ArgThrPheSerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArg 

TrpGlyGlyThrCysLysValLeuGlyProAspCysCysIleGlylleGlu 

AspLeuSerLysAsnlleSerGluGlnlleAspLysIleArgLysAsp 

GluGlnLysGluGluThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSer 

AspTrpGlyValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAla 

ValLeuIleAlaLeuSerCysIleCysArgllePheThrLysTyrlleGly* 
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SEQ ID NO: 7 

RVNGP1 /MUSGP2 nucleotide sequence 

10 20 30 40 50 

atgaagacca tatattttct gattagtctc attttaatcc aaagtataaa 

60 70 80 90 100 

aactctccct gttttagaaa ttgctagtaa cagccaacct caagatgtag 

110 120 130 140 150 

attcagtgtg ctccggaacc ctccaaaaga cagaagatgt tcatctgatg 

160 170 180 190 200 

ggatttacac tgagtgggca aaaagttgct gattcccctt tggaagcatc 

210 220 230 240 250 

taaacgatgg gctttcagga caggtgttcc tcccaagaac gttgagtata 

260 270 280 290 300 

cggaaggaga agaagccaaa acatgttaca atataagtgt aacagaccct 

310 320 330 340 350 

tctggaaaat ccttgctgct ggatcctccc agtaatatcc gcgattaccc 

360 370 380 390 400 

taaatgtaaa actgttcatc atattcaagg tcaaaaccct catgcacagg 

410 420 430 440 450 

ggattgccct ccatttgtgg ggggcatttt tcttgtatga tcgcgttgcc 

460 470 480 490 500 

tctacaacaa tgtaccgagg caaggtcttc actgaaggaa atatagcagc 

510 520 530 540 550 

tatgattgtt aataagacag ttcacagaat gattttttct aggcaaggac 

560 570 580 590 600 

aaggttatcg tcacatgaac ttgacctcca ccaataaata ttggacaagc 

610 620 630 640 650 

agcaatgaaa cgcagagaaa tgatacggga tgttttggca tcctccaaga 

660 670 680 690 700 

atacaactcc acaaacaatc aaacatgccc tccatctctt aaacctccat 

710 720 730 740 750 

ccctgcccac agtaactccg agcattcact ctacaaatac tcaaattaat 

760 770 780 790 800 

actgctaaat ctggaactat gaacccaagt agcgacgatg aggaccttat 

810 820 830 840 850 

gatttccggc tcaggatctg gagaacaggg gccccacaca actcttaatg 
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860 870 880 890 900 

tagtcactga acagaaacaa tcgtcaacaa tattgtccac tccttcacta> 

910 920 930 940 950 

catccaagca cctcacaaca tgagcaaaac agtacgaatc cttcccgaca 

960 970 980 990 1000 

tgctgtaact gagcacaatg gaaccgaccc aacaacacaa ccagcaacgc 

1010 1020 1030 1040 1050 

tcctcaacaa tactaataca actcccacct ataacactct caagtacaac 

1060 1070 1080 1090 1100 

ctcagtactc cttcccctcc aacccgcaac atcaccaata atgatacaca 

1110 1120 1130 1140 1150 

acgtgaacta gcagaaagcg aacaaaccaa tgctcagttg aacacaactc 

1160 1170 1180 1190 1200 

tagatccaac agaaaatccc accacaggac aagacaccaa cagcacaacc 

1210 1220 1230 1240 1250 

aacatcatca tgacgacatc agatataaca agcaaacacc ccacaaattc 

1260 1270 1280 1290 1300 

ttctccggat tctagtccga caacccgccc tcctatatac tttagaaaga 

1310 1320 1330 1340 1350 

aacgatcgat cctctggagg gaaggcgaca tgttcccttt tctggatggg 

1360 1370 1380 1390 1400 

ttaataaatg ctccaattga ttttgaccca gttccaaata caaaaacaat 

1410 1420 1430 1440 1450 

ctttgatgaa tcctctagtt ctggtgcctc ggctgaggaa gatcaacatg 

1460 1470 1480 1490 1500 

cctcccccaa tattagttta actttatctt attttcctaa tataaatgag 

1510 1520 1530 1540 1550 

aacactgcct actctggaga aaatgagaat gattgtgatg cagagttaag 

1560 1570 1580 1590 1600 

aatttggagc gttcaggagg atgacctggc cgcagggctc agttggatac 

1610 1620 1630 1640 1650 

cgttttttgg ccctggaatt gaaggacttt acactgctgt tttaattaaa> 

1660 1670 1680 1690 1700 

aatcaaaaca atttggtctg caggttgagg cgtctagcca atcaaactgc 

1710 1720 1730 1740 1750 

caaatccttg gaactcttat tgagagtcac aactgaggaa agaacattct 

1760 1770 1780 1790 1800 
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ccttaatcaa tagacatgct attgactttc tactcacaag atggggagga 

1810 1820 1830 1840 1850 

acatgcaaag tgcttggacc tgattgttgc atcgggatag aagacttgtc 

1860 1870 1880 1890 1900 

caaaaatatt tcagagcaaa ttgaccaaat taaaaaggac gaacaaaaag 

1910 1920 1930 1940 1950 

aggggactgg ttggggtctg ggtggtaaat ggtggacatc cgactggggt 

1960 1970 1980 1990 2000 

gttcttacta acttgggcat tttgctacta ttatccatag ctgtcttgat 

2010 2020 2030 2040 2050 

tgctctatcc tgtatttgtc gtatctttac taaatatatc ggataa 



SEQ ID NO: 8 

RVNGPl /MUSGP2 amino acid sequence 



MetLysThrlleTyrPheLeuIleSerLeuIleLeuIleGlnSerlleLys 

ThrLeuProValLeuGluIleAlaSerAsnSerGlnProGlnAspVal 

AspSerValCysSerGlyThrLeuGlnLysThrGl\iAspValHisLeuMet 

GlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeuGluAlaSer 

LysArgTrpAlaPheArgThrGlyValProProLysAsnValGluTyr 

ThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerValThrAspPro 

SerGlyLysSerLeuLeuLeuAspProProSerAsnlleArgAspTyrPro 

LysCysLysThrValHisHisIleGlnGlyGlnAsnProHisAlaGln 

GlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAspArgValAla 

SerThrThrMetTyrArgGlyLysValPheThrGluGlyAsnlleAlaAla 

Me 1 1 1 e Va lAsnLys Thr Va 1 H i s Ar gMe tllePheSerArgGlnGly 

GlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyrTrpThrSer 

SerAsnGluThrGlnArgAsnAspThrGlyCysPheGlylleLeuGlnGlu 

TyrAsnSerThrAsnAsnGlnThrCysProProSerLeuLysProPro 

SerLeuProThrValThrProSerlleHisSerThrAsnThrGlnlleAsn 

ThrAlaLysSerGlyThrMetAsnProSerSerAspAspGluAspLeuMet 

IleSerGlySerGlySerGlyGluGlnGlyProHisThrThrLeuAsn 

ValValThrGluGlnLysGlnSerSerThrlleLeuSerThrProSerLeu 

HisProSerThrSerGlnHisGluGlnAsnSerThrAsnProSerArgHis 

AlaValThrGluHisAsnGlyThrAspProThrThrGlnProAlaThr 
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LeuLeuAsri^snThrAsnThrThrProThrTyrAsnThrLeuLysTyrAsn 

LeuSerThrProSerProProThrArgAsnlleThrAsnAsnAspThrGln 

ArgGluLeuAlaGluSerGluGlnThrAsnAlaGlnLeuAsnThrThr 

LeuAspProThrGluAsnProThrThrGlyGlnAspThrAsnSerThrThr 

AsnllelleMetThrThrSerAspIleThrSerLysHisProThrAsnSer 

SerProAspSerSerProThrThrArgProProIleTyrPheArgLys 

LysArgSerlleLeuTrpArgGluGlyAspMetPheProPheLeuAspGly 

LeuIleAsnAlaProIleAspPheAspProValProAsnThrLysThrlle 

PheAspGluSerSerSerSerGlyAlaSerAlaGluGluAspGlnHis 

AlaSerProAsnlleSerLeuThrLeuSerTyrPheProAsnlleAsnGlu 

AsnThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAlaGluLeuArg 

IleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeuSerTrpIle 

ProPhePheGlyProGlylleGluGlyLeuTyrThrAlaValLeuIleLys 

AsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsnGlnThrAla 

LysSerLeuGluLeuLeuLeuArgValThrThrGluGluArgThrPhe 

SerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArgTrpGlyGly 

ThrCysLysValLeuGlyProAspCysCysIleGlylleGluAspLeuSer 

LysAsnlleSerGluGlnlleAspGlnlleLysLysAspGluGlnLys 

GluGlyThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSerAspTrpGly 

ValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAlaValLeuIle 

AlaLeuSerCysIleCysArgllePheThrLysTyrlleGly* 

SEQ ID NO: 9 

MBGGP1 /MBGGP2 nucleotide sequence 



10 20 30 40 50 

aagcttaaca tgaagaccac atgtttcctt atcagtctta tcttaattca 

60 70 80 90 100 

agggacaaaa aatctcccca ttttagagat agctagtaat aatcaacccc 

110 120 130 140 150 

aaaatgtgga ttcggtatgc tccggaactc tccagaagac agaagacgtc 

160 170 180 190 200 

catctgatgg gattcacact gagtgggcaa aaagttgctg attccccttt 

210 220 230 240 250 
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ggaggcatcc aagcgatggg ctttcaggac aggtgtacct cccaagaatg 

260 270 280 290 300 

ttgagtacac agagggggag gaagccaaaa catgctacaa tataagtgta 

310 320 330 340 350 

acggatccct ctggaaaatc cttgctgtta gatcctccta ccaacatccg 

360 370 380 390 400 

tgactatccg aaatgcaaaa ctatccatca tattcaaggt caaaaccctc 

410 420 430 440 450 

atgcacaggg gatcgccctt catttatggg gagcattttt tctgtatgat 

460 470 480 490 500 

cgcattatgt accgaggcaa agtcttcact gaagggaaca tagcagctat 

510 520 530 540 550 

gattgtcaat aagacagtgc acaaaatgat tttctcgcgg caaggacaag 

560 570 580 590 600 

ggtaccgtca tatgaatctg acttctacta ataaatattg gacaagtagt 

610 620 630 640 650 

aacggaacgc aaacgaatga cactggatgt ttcggcgctc ttcaagaata 

660 670 680 690 700 

caattctaca aagaaccaaa catgtgctcc gtccaaaata cctccaccac 

710 720 730 740 750 

tgcccacagc ccgtccggag atcaaactca caagcacccc aactgatgcc 

760 770 780 790 800 

accaaactca ataccacgga cccaagcagt gatgatgagg acctcgcaac 

810 820 830 840 850 

atccggctca gggtccggag aacgagaacc ccacacaact tctgatgcgg 

860 870 880 890 900 

tcaccaagca agggctttca tcaacaatgc cacccactcc ctcaccacaa 

910 920 930 940 950 

ccaagcacgc cacagcaagg aggaaacaac acaaaccatt cccaagatgc 

960 970 980 990 1000 

tgtgactgaa ctagacaaaa ataacacaac tgcacaaccg tccatgcccc 

1010 1020 1030 1040 1050 

ctcataacac taccacaatc tctactaaca acacctccaa acacaacttc 

1060 1070 1080 1090 1100 

agcactctct ctgcaccatt acaaaacacc accaatgaca acacacagag 

1110 1120 1130 1140 1150 
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cacaatcact gaaaatgagc aaaccagtgc cccctcgata acaaccctgc 

1160 1170 1180 1190 1200 

ctccaacggg aaatcccacc acagcaaaga gcaccagcag caaaaaaggc 

1210 1220 1230 1240 1250 

cccgccacaa cggcaccaaa cacgacaaat gagcatttca ccagtcctcc 

1260 1270 1280 1290 1300 

ccccaccccc agctcgactg cacaacatct tgtatatttc agaagaaagc 

1310 1320 1330 1340 1350 

gatcgatcct ctggagggaa ggcgacatgt tcccttttct ggatgggtta 

1360 1370 1380 1390 1400 

ataaatgctc caattgattt tgacccagtt ccaaatacaa aaacaatctt 

1410 1420 1430 1440 1450 

tgatgaatcc tctagttctg gtgcctcggc tgaggaagat caacatgcct 

1460 1470 1480 1490 1500 

cccccaatat tagtttaact ttatcttatt ttcctaatat aaatgagaac 

1510 1520 1530 1540 1550 

actgcctact ctggagaaaa tgagaatgat tgtgatgcag agttaagaat 

1560 1570 1580 1590 1600 

ttggagcgtt caggaggatg acctggccgc agggctcagt tggataccgt 

1610 1620 1630 1640 1650 

tttttggccc tggaattgaa ggactttaca ctgctgtttt aattaaaaat 

1660 1670 1680 1690 1700 

caaaacaatt tggtctgcag gttgaggcgt ctagccaatc aaactgccaa 

1710 1720 1730 1740 1750 

atccttggaa ctcttattga gagtcacaac tgaggaaaga acattctcct 

1760 1770 1780 1790 1800 

taatcaatag acatgctatt gactttctac tcacaagatg gggaggaaca 

1810 1820 1830 1840 1850 

tgcaaagtgc ttggacctga ttgttgcatc gggatagaag acttgtccaa 

1860 1870 1880 1890 1900 

aaatatttca gagcaaattg accaaattaa aaaggacgaa caaaaagagg 

1910 1920 1930 1940 1950 

ggactggttg gggtctgggt ggtaaatggt ggacatccga ctggggtgtt 

1960 1970 1980 1990 2000 

cttactaact tgggcatttt gctactatta tccatagctg tcttgattgc 

2010 2020 2030 2040 2050 
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tctatcctgt atttgtcgta tctttactaa atatatcgga taacggaatt 
c 

SEQ ID NO: 10 

MBGGPl /MBGGP2 amino acid sequence 

MetLysThrThrCysPheLeuIleSerLeuIleLeuIleGlnGlyThr 

LysAsnLeuProIleLeuGluIleAlaSerAsnAsnGlnProGln 

AsnValAspSerValCysSerGlyThrLeuGlnLysThrGluAspVal 

HisLeuMetGlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeu 

GluAlaSerLysArgTrpAlaPheArgThrGlyValProProLysAsn 

ValGluTyrThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerVal 

ThrAspProSerGlyLysSerLeuLeuLeuAspProProThrAsnlleArg 

AspTyrProLysCysLysThrlleHisHisIleGlnGlyGlnAsnPro 

HisAlaGlnGlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAsp 

ArglleAlaSerThrThrMetTyrArgGlyLysValPheThrGluGlyAsn 

IleAlaAlaMetlleValAsnLysThrValHisLysMetllePheSer 

ArgGlnGlyGlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyr 

TrpThrSerSerAsnGlyThrGlnThrAsnAspThrGlyCysPheGlyAla 

LeuGlnGluTyrAsnSerThrLysAsnGlnThrCysAlaProSerLys 

IleProProProLeuProThrAlaArgProGluIleLysLeuThrSerThr 

ProThrAspAlaThrLysLeuAsnThrThrAspProSerSerAspAspGlu 

AspLeuAlaThrSerGlySerGlySerGlyGluArgGluProHisThr 

ThrSerAspAlaValThrLysGlnGlyLeuSerSerThrMetProProThr 

ProSerProGlnProSerThrProGlnGlnGlyGlyAsnAsnThrAsnHis 

SerGlnAspAlaValThrGluLeuAspLysAsnAsnThrThrAlaGln 

ProSerMetProProHisAsnThrThrThrlleSerThrAsnAsnThrSer 

LysHisAsnPheSerThrLeuSerAlaProLeuGlnAsnThrThrAsnAsp 

AsnThrGlnSerThrlleThrGluAsnGluGlnThrSerAlaProSer 
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IleThrThrLeuProProThrGlyAsnProThrThrAlaLysSerThrSer 

SerLysLysGlyProAlaThrThrAlaProAsnThrThrAsnGluHisPhe 

ThrSerProProProThrProSerSerThrAlaGlnHisLeuValTyr 

PheArgArgLysArgSerlleLeuTrpArgGluGlyAspMetPheProPhe 

LeuAspGlyLeuIleAsnAlaProIleAspPheAspProValProAsnThr 

LysThrllePheAspGluSerSerSerSerGlyAlaSerAlaGluGlu 

AspGlnHisAlaSerProAsnlleSerLeuThrLeuSerTyrPheProAsn 

IleAsnGluAsnThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAla 

GluLeuArglleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeu 

SerTrpIleProPhePheGlyProGlylleGluGlyLeuTyrThrAlaVal 

LeuIleLysAsnGlnAsnAsnLeuValCysArgLeuArgArgLeuAlaAsn 

GlnThrAlaLysSerLeuGluLeuLeuLeuArgValThrThrGluGlu 

ArgThrPheSerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArg 

TrpGlyGlyThrCysLysValLeuGlyProAspCysCysIleGlylleGlu 

AspLeuSerLysAsnlleSerGluGlnlleAspGlnlleLysLysAsp 

GluGlnLysGluGlyThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSer 

AspTrpGlyValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAla 

ValLeuIleAlaLeuSerCysIleCysArgllePheThrLysTyrlle 

Gly* 

SEQ ID NO: 11 

EBOGP1 / EBOGP2 nucleotide sequence 

10 20 30 40 50 

atgggcgtta caggaatatt gcagttacct cgtgatcgat tcaagaggac 

60 70 80 90 100 

atcattcttt ctttgggtaa ttatcctttt ccaaagaaca ttttccatcc 

110 120 130 140 150 

cacttggagt catccacaat agcacattac aggttagtga tgtcgacaaa 



160 170 180 190 200 

ctagtttgtc gtgacaaact gtcatccaca aatcaattga gatcagttgg 
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210 220 230 240 250 

actgaatctc gaagggaatg gagtggcaac tgacgtgcca tctgcaacta 

260 270 280 290 300 

aaagatgggg cttcaggtcc ggtgtcccac caaaggtggt caattatgaa 

310 320 330 340 350 

gctggtgaat gggctgaaaa ctgctacaat cttgaaatca aaaaacctga 

360 370 380 390 400 

cgggagtgag tgtctaccag cagcgccaga cgggattcgg ggcttccccc 

410 420 430 440 450 

ggtgccggta tgtgcacaaa gtatcaggaa cgggaccgtg tgccggagac 

460 470 480 490 500 

tttgccttcc ataaagaggg tgctttcttc ctgtatgatc gacttgcttc 

510 520 530 540 550 

cacagttatc taccgaggaa cgactttcgc tgaaggtgtc gttgcatttc 

560 570 580 590 600 

tgatactgcc ccaagctaag aaggacttct tcagctcaca ccccttgaga 

610 620 630 640 650 

gagccggtca atgcaacgga ggacccgtct agtggctact attctaccac 

660 670 680 690 700 

aattagatat caggctaccg gttttggaac caatgagaca gagtacttgt 

710 720 730 740 750 

tcgaggttga caatttgacc tacgtccaac ttgaatcaag attcacacca 

760 770 780 790 800 

cagtttctgc tccagctgaa tgagacaata tatacaagtg ggaaaaggag 

810 820 830 840 850 

caataccacg ggaaaactaa tttggaaggt caaccccgaa attgatacaa 

860 870 880 890 900 

caatcgggga gtgggccttc tgggaaacta aaaaaaacct cactagaaaa 

910 920 930 940 950 

attcgcagtg aagagttgtc tttcacagtt gtatcaaacg gagccaaaaa 

960 970 980 990 1000 

catcagtggt cagagtccgg cgcgaacttc ttccgaccca gggaccaaca 

1010 1020 1030 1040 1050 

caacaactga agaccacaaa atcatggctt cagaaaattc ctctgcaatg 

1060 1070 1080 1090 1100 

gttcaagtgc acagtcaagg aagggaagct gcagtgtcgc atctaacaac 
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1110 1120 1130 1140 1150 

ccttgccaca atctccacga gtccccaatc cctcacaacc aaaccaggtc 

1160 1170 1180 1190 1200 

cggacaacag cacccataat acacccgtgt ataaacttga catctctgag 

1210 1220 1230 1240 1250 

gcaactcaag ttgaacaaca tcaccgcaga acagacaacg acagcacagc 

1260 1270 1280 1290 1300 

ctccgacact ccctctgcca cgaccgcagc cggaccccca aaagcagaga 

1310 1320 1330 1340 1350 

acaccaacac gagcaagagc actgacttcc tggaccccgc caccacaaca 

1360 1370 1380 1390 1400 

agtccccaaa accacagcga gaccgctggc aacaacaaca ctcatcacca 

1410 1420 1430 1440 1450 

agataccgga gaagagagtg ccagcagcgg gaagctaggc ttaattacca 

1460 1470 1480 1490 1500 

atactattgc tggagtcgca ggactgatca caggcgggag aagaactcga 

1510 1520 1530 1540 1550 

cgatcggcaa ttgtcaatgc tcaacccaaa tgcaacccta atttacatta 

1560 1570 1580 1590 1600 

ctggactact caggatgaag gtgctgcaat cggactggcc tggataccat 

1610 1620 1630 1640 1650 

atttcgggcc agcagccgag ggaatttaca tagaggggct aatgcacaat 

1660 1670 1680 1690 1700 

caagatggtt taatctgtgg gttgagacag ctggccaacg agacgactca 

1710 1720 1730 1740 1750 

agctcttcaa ctgttcctga gagccacaac tgagctacgc accttttcaa 

1760 1770 1780 1790 1800 

tcctcaaccg taaggcaatt gatttcttgc tgcagcgatg gggcggcaca 

1810 1820 1830 1840 1850 

tgccacattc tgggaccgga ctgctgtatc gaaccacatg attggaccaa 

1860 1870 1880 1890 1900 

gaacataaca gacaaaattg atcagattat tcatgatttt gttgataaaa 

1910 1920 1930 1940 1950 

cccttccgga ccagggggac aatgacaatt ggtggacagg atggagacaa 

1960 1970 1980 1990 2000 

tggataccgg caggtattgg agttacaggc gttataattg cagttatcgc 
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# 



2010 2020 2030 

tttattctgt atatgcaaat ttgtctttta gttgaattc 

SEQ ID NO: 12 

EBOGPl / EBOGP2 amino acid sequence 

MetGlyValThrGlylleLeuGlnLeuProArgAspArgPheLys 

ArgThrSerPhePheLeuTrpValllelleLeuPheGlnArgThrPhe 

SerlleProLeuGlyVallleHisAsnSerThrLeuGlnValSerAsp 

ValAspLysLeuValCysArgAspLysLeuSerSerThrAsnGlnLeuArg 

SerValGlyLeuAsnLeuGluGlyAsnGlyValAlaThrAspValProSer 

AlaThrLysArgTrpGlyPheArgSerGlyValProProLysValVal 

AsnTyrGluAlaGlyGluTrpAlaGluAsnCysTyrAsnLeuGluIleLys 

LysProAspGlySerGluCysLeuProAlaAlaProAspGlylleArgGly 

Phe Pr oAr gCy s Ar gTy rVa lHi sLy s Va 1 S e rG lyThrG ly Pr oCy s 

AlaGlyAspPheAlaPheHisLysGluGlyAlaPhePheLeuTyrAspArg 

LeuAlaSerThrVallleTyrArgGlyThrThrPheAlaGluGlyValVal 

AlaPheLeuIleLeuProGlnAlaLysLysAspPhePheSerSerHis 

ProLeuArgGluProValAsnAlaThrGluAspProSerSerGlyTyrTyr 

SerThrThrlleArgTyrGlnAlaThrGlyPheGlyThrAsnGluThrGlu 

TyrLeuPheGluValAspAsnLeuThrTyrValGlnLeuGluSerArg 

PheThrProGlnPheLeuLeuGlnLeuAsnGluThrlleTyrThrSerGly 

LysArgSerAsnThrThrGlyLysLeuIleTrpLysValAsnProGluIle 

AspThrThrlleGlyGluTrpAlaPheTrpGluThrLysLysAsnLeu 

ThrArgLysIleArgSerGluGluLeuSerPheThrValValSerAsnGly 

AlaLysAsnlleSerGlyGlnSerProAlaArgThrSerSerAspProGly 

ThrAsnThrThrThrGluAspHisLysIleMetAlaSerGluAsnSer 

SerAlaMetValGlnValHisSerGlnGlyArgGluAlaAlaValSerHis 

LeuThrThrLeuAlaThrlleSerThrSerProGlnSerLeuThrThrLys 

ProGlyProAspAsnSerThrHisAsnThrProValTyrLysLeuAsp 
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IleSerGluAlaThrGlnValGluGlnHisHisArgArgThrAspAsnAsp 

SerThrAlaSerAspThrProSerAlaThrThrAlaAlaGlyProProLys 

AlaGluAsnThrAsnThrSerLysSerThrAspPheLeuAspProAla 

ThrThrThrSerProGlnAsnHisSerGluThrAlaGlyAsnAsnAsnThr 

HisHisGlnAspThrGlyGluGluSerAlaSerSerGlyLysLeuGlyLeu> 

IleThrAsnThrlleAlaGlyValAlaGlyLeuIleThrGlyGlyArg 

ArgThrArgArgSerAlalleValAsnAlaGlnProLysCysAsnProAsn 

LeuHisTyrTrpThrThrGlnAspGluGlyAlaAlalleGlyLeuAlaTrp 

IleProTyrPheGlyProAlaAlaGluGlylleTyrlleGluGlyLeu 

MetHisAsnGlnAspGlyLeuIleCysGlyLeuArgGlnLeuAlaAsnGlu 

ThrThrGlnAlaLeuGlnLeuPheLeiiArgAlaThrThrGluLeuArgThr 

PheSerlleLeiiAsnArgLysAlalleAspPheLeuLeuGlnArgTrp 

GlyGlyThrCysHi s I leLeuGlyProAspCysCys I leGluProHi sAsp 

TrpThrLysAsnlleThrAspLysIleAspGlnllelleHisAspPheVal 

AspLysThrLeuProAspGlnGlyAspAsnAspAsnTrpTrpThrGly 

TrpArgGlnTrpIleProAlaGlylleGlyValThrGlyValllelleAla 

Va 1 1 leAlaLeuPheCy s 1 1 eCy sLy s PheVal Phe * 

SEQ ID NO: 13 

RVNGP1/RVNGP2 nucleotide sequence 



10 


20 


30 


40 


50 


atgaagacca 


tatattttct 


gattagtctc 


attttaatcc 


aaagtataaa 


60 


70 


80 


90 


100 


aactctccct 


gttttagaaa 


ttgctagtaa 


cagccaacct 


caagatgtag 


110 


120 


130 


140 


150 


attcagtgtg 


ctccggaacc 


ctccaaaaga 


cagaagatgt 


tcatctgatg 


160 


170 


180 


190 


200 


ggatttacac 


tgagtgggca 


aaaagttgct 


gattcccctt 


tggaagcatc 


210 


220 


230 


240 


250 


taaacgatgg 


gctttcagga 


caggtgttcc 


tcccaagaac 


gttgagtata 
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260 270 280 290 300 

cggaaggaga agaagccaaa acatgttaca atataagtgt aacagaccct 

310 320 330 ' 340 350 

tctggaaaat ccttgctgct ggatcctccc agtaatatcc gcgattaccc 

360 370 380 390 400 

taaatgtaaa actgttcatc atattcaagg tcaaaaccct catgcacagg 

410 420 430 440 450 

ggattgccct ccatttgtgg ggggcatttt tcttgtatga tcgcgttgcc 

460 470 480 490 500 

tctacaacaa tgtaccgagg caaggtcttc actgaaggaa atatagcagc 

510 520 530 540 550 

tatgattgtt aataagacag ttcacagaat gattttttct aggcaaggac 

560 570 580 590 600 

aaggttatcg tcacatgaac ttgacctcca ccaataaata ttggacaagc 

610 620 630 640 650 

agcaatgaaa cgcagagaaa tgatacggga tgttttggca tcctccaaga 

660 670 680 690 700 

atacaactcc acaaacaatc aaacatgccc tccatctctt aaacctccat 

710 720 730 740 750 

ccctgcccac agtaactccg agcattcact ctacaaatac tcaaattaat 

760 770 780 790 800 

actgctaaat ctggaactat gaacccaagt agcgacgatg aggaccttat 

810 820 830 840 850 

gatttccggc tcaggatctg gagaacaggg gccccacaca actcttaatg 

860 870 880 890 900 

tagtcactga acagaaacaa tcgtcaacaa tattgtccac tccttcacta 

910 920 930 940 950 

catccaagca cctcacaaca tgagcaaaac agtacgaatc cttcccgaca 

960 970 980 990 1000 

tgctgtaact gagcacaatg gaaccgaccc aacaacacaa ccagcaacgc 

1010 1020 1030 1040 1050 

tcctcaacaa tactaataca actcccacct ataacactct caagtacaac 

1060 1070 1080 1090 1100 

ctcagtactc cttcccctcc aacccgcaac atcaccaata atgatacaca 

1110 1120 1130 1140 1150 

acgtgaacta gcagaaagcg aacaaaccaa tgctcagttg aacacaactc 
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1160 1170 1180 1190 1200 

tagatccaac agaaaatccc accacaggac aagacaccaa cagcacaacc 

1210 1220 1230 1240 1250 

aacatcatca tgacgacatc agatataaca agcaaacacc ccacaaattc 

1260 1270 1280 1290 1300 

ttctccggat tctagtccga caacccgccc tcctatatac tttagaaaga 

1310 1320 1330 1340 1350 

aacgatcgat tttctggaaa gaaggtgata tattcccgtt tttagatggg 

1360 1370 1380 1390 1400 

ttaataaata ctgaaattga ttttgatcca atcccaaaca cagaaacaat 

1410 1420 1430 1440 1450 

ctttgatgaa tctcccagct ttaatacttc aactaatgag gaacaacaca 

1460 1470 q 1480 1490 1500 

ctcccccgaa tatcagttta actttctctt attttcctga taaaaatgga 

1510 1520 1530 1540 1550 

gatactgcct actctgggga aaacgagaat gattgtgatg cagagttgag 

1560 1570 1580 1590 1600 

gatttggagt gtgcaggagg acgatttggc ggcagggctt agctggatac 

1610 1620 1630 1640 1650 

cattttttgg ccctggaatc gaaggactct atactgccgg tttaatcaaa 

1660 1670 1680 1690 1700 

aatcagaaca atttagtttg taggttgagg cgcttagcta atcaaactgc 

1710 1720 1730 1740 1750 

taaatccttg gagctcttgt taagggtcac aaccgaggaa aggacatttt 

1760 1770 1780 1790 1800 

ccttaatcaa taggcatgca attgactttt tgcttacgag gtggggcgga 

1810 1820 1830 1840 1850 

acatgcaagg tgctaggacc tgattgttgc ataggaatag aagatctatc 

1860 1870 1880 1890 1900 

taaaaatatc tcagaacaaa tcgacaaaat cagaaaggat gaacaaaagg 

1910 1920 1930 1940 1950 

aggaaactgg ctggggtcta ggtggcaaat ggtggacatc tgactggggt 

1960 1970 1980 1990 2000 

gttctcacca atttgggcat cctgctacta ttatctatag ctgttctgat 

2010 2020 2030 2040 

tgctctgtcc tgtatctgtc gtatcttcac taaatacatt ggatga 
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SEQ ID NO: 14 

RVNGP1/RVNGP2 amino acid sequence 

MetLysThrlleTyrPheLeuIleSerLeuIleLeuIleGlnSerlleLys 

ThrLeuProValLeuGluIleAlaSerAsnSerGlnProGlnAspVal 

AspSerValCysSerGlyThrLeuGlnLysThrGluAspValHisLeuMet 

GlyPheThrLeuSerGlyGlnLysValAlaAspSerProLeuGluAlaSer 

LysArgTrpAlaPheArgThrGlyValProProLysAsnValGluTyr 

ThrGluGlyGluGluAlaLysThrCysTyrAsnlleSerValThrAspPro 

SerGlyLysSerLeuLeuLeuAspProProSerAsnlleArgAspTyrPro 

LysCysLysThrValHisHisIleGlnGlyGlnAsnProHisAlaGln 

GlylleAlaLeuHisLeuTrpGlyAlaPhePheLeuTyrAspArgValAla 

SerThrThrMetTyrArgGlyLysValPheThrGluGlyAsnlleAlaAla 

MetlleValAsnLysThrValHisArgMetllePheSerArgGlnGly 

GlnGlyTyrArgHisMetAsnLeuThrSerThrAsnLysTyrTrpThrSer 

SerAsnGluThrGlnArgAsnAspThrGlyCysPheGlylleLeuGlnGlu 

TyrAsnSerThrAsnAsnGlnThrCysProProSerLeuLysProPro 

SerLeuProThrValThrProSerlleHisSerThrAsnThrGlnlleAsn 

ThrAlaLysSerGlyThrMetAsnProSerSerAspAspGluAspLeuMet 

IleSerGlySerGlySerGlyGluGlnGlyProHisThrThrLeuAsn 

ValValThrGluGlnLysGlnSerSerThrlleLeuSerThrProSerLeu 

HisProSerThrSerGlnHisGluGlnAsnSerThrAsnProSerArgHis 

AlaValThrGluHisAsnGlyThrAspProThrThrGlnProAlaThr 

LeuLeuAsnAsnThrAsnThrThrProThrTyrAsnThrLeuLysTyrAsn 

LeuSerThrProSerProProThrArgAsnlleThrAsnAsnAspThrGln 

ArgGluLeuAlaGluSerGluGlnThrAsnAlaGlnLeuAsnThrThr 

LeuAspProThrGluAsnProThrThrGlyGlnAspThrAsnSerThrThr 

AsnllelleMetThrThrSerAspIleThrSerLysHisProThrAsnSer 

SerProAspSerSerProThrThrArgProProIleTyrPheArgLys 

LysArgSerllePheTrpLysGluGlyAspIlePheProPheLeuAspGly 

LeuIleAsnThrGluIleAspPheAspProIleProAsnThrGluThrlle 

PheAspGluSerProSerPheAsnThrSerThrAsnGluGluGlnHis 

ThrProProAsnlleSerLeuThrPheSerTyrPheProAspLysAsnGly 
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AspThrAlaTyrSerGlyGluAsnGluAsnAspCysAspAlaGluLeuArg 

IleTrpSerValGlnGluAspAspLeuAlaAlaGlyLeuSerTrpIle 

ProPhePheGlyProGlylleGluGlyLeuTyrThrAlaGlyLeuIleLys 

AsnGlnAsnAsnLeuValCysArgLeiiArgArgLeuAlaAsnGlnThrAla 

LysSerLeuGluLeuLeuLeuArgValThrThrGluGluArgThrPhe 

SerLeuIleAsnArgHisAlalleAspPheLeuLeuThrArgTrpGlyGly 

ThrCysLysValLeuGlyProAspCysCysIleGlylleGluAspLeuSer 

LysAsnlleSerGluGlnlleAspLysIleArgLysAspGluGlnLys 

GluGluThrGlyTrpGlyLeuGlyGlyLysTrpTrpThrSerAspTrpGly 

ValLeuThrAsnLeuGlylleLeuLeuLeuLeuSerlleAlaValLeuIle 

AlaLeuSerCysIleCysArgllePheThrLysTyrlleGly* 

SEQ ID NO: 15 

GP1 forward (containing Hindlll) EBOV (5' -CAA GCT TCA ATG GGC 
GTT ACA-3 ' ) , 

SEQ ID NO: 16 

MBGV (5' -AAG CTT AAC ATG AAG ACC ACA T-3 ' ) ; 
SEQ ID NO: 17 

GP2 forward (containing Pvul) EBOV (5' -GAC GAT CGG CAA TTG 
TCA ATG -3 ' ) , 

SEQ ID NO: 18 

MBGV (5'-AGC GAT CGA TCC TCT GGA G -3 ' ) ; 
SEQ ID NO: 19 

GP1 reverse (containing Pvul) EBOV (5'-GCC GAT CGT CGA GTT 
CTT CT-3 ' ) , 

SEQ ID NO: 20 

MBGV (5 '-GAT CGA TCG CTT TCT TCT G-3'); 
SEQ ID NO: 21 

GP2 reverse (containing EcoRI) EBOV (5'-TGA ATT CAA CTA AAA 
GAC AAA TTT G-3 ' ) , 

SEQ ID NO: 22 

MBGV (5' -CGA ATT CCG TTA TCC GAT ATA T-3') 
SEQ ID NO: 23 

GP1 forward (containing Hindlll) MBGV MUS (5' -AAG CTT AAC ATG 
AAG ACC ACA T-3 ' ) 

SEQ ID NO: 24 

MBGV RVN (5 -AAG CTT CGA CAT GAA GAC CAT AT- 3 ' ) ; 



* 
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• 



SEQ ID NO: 25 

GP2 forward (containing Pvul) MBGV MUS (5'-AGC GAT CGA TCC 
TCT GGA G -3 ' ) , 

SEQ ID NO: 26 

MBGV RVN (5'-AAC GAT CGA TTT TCT GGA A-3 ' ) ; 
SEQ ID NO: 27 

GP1 reverse (containing Pvul) MBGV MUS (5' -GAT CGA TCG CTT 
TCT TCT G-3 ' ) , 

SEQ ID NO: 28 

MBGV RVN (5' -AAA TCG ATC GTT TCT TTC TAA AG- 3 ' ) ; 
SEQ ID NO: 29 

GP2 reverse (containing EcoRI) MBGV MUS (5' -CGA ATT CCG TTA 
TCC GAT ATA T-3 ' ) , 

SEQ ID NO: 30 

MBGV RVN (5 '-CGA ATT CTG TCA TCC AAT GTA T-3') . 



